147 results found.
Written
Treebank,
Language Type:
Monolingual
Languages:
Czech
Availability:
From Data Center(s)
License:
Size:
None Production Status:
Existing-used
Use:
Parsing and Tagging
-
Paper title:Neural Reranking for Dependency Parsing: An Evaluation
-
Paper track:Long/Syntax: Tagging, Chunking and Parsing
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Bich-Ngoc Do | Prague Dependency Treebank 3.0 (PDT 3.0) | /N |
Documentation:
None
Not Applicable
Contextualsed word embeddings,
Language Type:
Monolingual
Languages:
Ancient Arabic Basque Bokmål Bulgarian Catalan Chinese Church Croatian Czech Danish Dutch English Estonian Finnish French Galician German Greek Hebrew Hindi Hungarian Indonesian Irish Italian Japanese Korean Latin Latvian Norwegian Nynorsk Old Persian Polish Portuguese Romanian Russian Simplified Chinese Slavonic Slovak Slovene Spanish Swedish Turkish Ukrainian Urdu Uyghur Vietnamese
Availability:
Freely Available
License:
none
Size:
18.4 GByte Production Status:
Existing-used
Use:
Parsing and Tagging
-
Paper title:Treebank Embedding Vectors for Out-of-domain Dependency Parsing
-
Paper track:Short/Syntax: Tagging, Chunking and Parsing
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Joachim Wagner | Elmo For Many Languages | /N |
Documentation:
https://www.aclweb.org/anthology/K18-2005/
Written
Corpus,
Language Type:
Multilingual
Languages:
Chinese Czech English Finnish German Latvian Romanian Russian Turkish
Availability:
Freely Available
License:
Size:
3.9 MByte Production Status:
Existing-used
Use:
Evaluation/Validation
-
Paper title:Automatic Machine Translation Evaluation using Source Language Inputs and Cross-lingual Language Model
-
Paper track:Short/Resources and Evaluation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Kosuke Takahashi | WMT18 metrics shared task data | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Monolingual
Languages:
Bengali Czech Dari English Hindi Lao Mandarin Chinese Mesopotamian Arabic Moroccan Arabic North Levantine Arabic Panjabi Persian Polish Pushto Russian Slovak South Levantine Arabic Spanish Standard Arabic Tamil Thai Turkish Ukrainian Urdu
Availability:
From Owner
License:
LDC
Size:
204 hours Production Status:
Existing-used
Use:
Language Identification
-
Paper title:Metric learning loss functions to reduce domain mismatch in the x-vector space for language recognition
-
Paper track:4.1 Language identification and verification, lang/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Raphaël Duroselle | 2011 NIST Language Recognition Evaluation Test Set | /N |
Documentation:
None
Written
Lexicon,
Language Type:
Monolingual
Languages:
Czech
Availability:
Freely Available
License:
CC BY-NC-SA 4.0
Size:
6760 entries Production Status:
Existing-updated
Use:
syntactic analysis, rule-based generation
-
Paper title:Towards a Semi-Automatic Detection of Reflexive and Reciprocal Constructions and Their Representation in a Valency Lexicon
-
Paper track:Written/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Marketa Lopatkova | Valency Lexicon of Czech Verbs | /N |
Documentation:
Documentation in Czech and English
Speech
Corpus,
Language Type:
Bilingual
Languages:
Czech English
Availability:
License:
Attribution-ShareAlike 3.0 Unported (CC BY-SA 3.0 US)
Size:
7.3 GByte Production Status:
Existing-used
Use:
Meta-data analysis & gender exploration
-
Paper title:Gender Representation in Open Source Speech Resources
-
Paper track:Speech/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Mahault Garnerin | Vystadial | /N |
Documentation:
A proceeding paper in English
Speech
Corpus,
Language Type:
Multilingual
Languages:
Czech English French German
Availability:
Freely Available
License:
Not know yet
Size:
2 hoursProduction Status:
Newly created-in progress
Use:
Language Identification
-
Paper title:Detecting English Speech in the Air Traffic Control Voice Communication
-
Paper track:14.7 Automatic Speech Recognition in Air Traffic M/Poster Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Igor Szoke | ATCO2 ATC dataset | /N |
Documentation:
Not yet
Speech/Written
Corpus,
Language Type:
Multilingual
Languages:
Bulgarian Croatian Czech French German Mandarin Polish Portuguese Spanish Thai Turkish
Availability:
From Data Center(s)
License:
ELRA
Size:
18.7 GByteProduction Status:
Existing-used
Use:
Speech Recognition/Understanding
-
Paper title:Zero-shot Cross-Lingual Phonetic Recognition with External Language Embedding
-
Paper track:8.11 Cross-lingual and multilingual/accent aspects/Poster Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Heting Gao | GlobalPhone | /N |
Documentation:
None
Speech/Written
Corpus,
Language Type:
Multilingual
Languages:
Czech English German
Availability:
Freely Available
License:
CreativeCommons
Size:
10 hoursProduction Status:
Newly created-finished
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Lost in Interpreting: Speech Translation from Source or Interpreter?
-
Paper track:12.1 Spoken machine translation/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Matúš Žilinec | ESIC | /N |
Documentation:
documentation in English
Speech/Written
Corpus,
Language Type:
Multilingual
Languages:
Basque Belgian Dutch Croatian Czech Galician Greek Hungarian Portuguese Slovak Slovenian Spanish
Availability:
From Owner
License:
Size:
None Production Status:
Existing-used
Use:
Evaluation/Validation
-
Paper title:An Approach to Online Speaker Change Point Detection Using DNNs and WFSTs
-
Paper track:5.4 Speech and audio segmentation/Poster Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Lukas Mateju | COST278 database | /N |
Documentation:
None




